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CLAIMS 

1 . A method for comparing a first content with a second content to determine 
whether the contents are identical, the method comprising the steps of: 

identifying a protocol encoding the first content and second content; 
computing a first signature of the first content and a second signature of the sec- 
ond content; and 

comparing the first computed signature with the second signature to determine 
whether the first content is identical to the second content. 

2. The method of claim 1 further comprising the steps of: 

selecting a first set of data segments from the first content and a second set of data 
segments from the second content; and 

using the selected first set of data segments and the second set of data segments to 
compute the first signature and the second signature. 

3. The method of claim 2 wherein the selected first set of data segments and second 
set of data segments comprise locations associated with one or more protocol markers. 

4. The method of claim 1 wherein the step of computing the signature of the first 
content and the signature of the second content further comprises the steps of: 

identifying one or more protocol markers associated with the first content; and 
identifying one or more protocol markers associated with the second content. 

5. The method of claim 4 wherein the one or more protocol markers associated with 
the first content comprises discrete cosine coefficients. 

6. The method of claim 4 wherein the one or more protocol markers associated with 
the second content comprises discrete cosine coefficients. 
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7. The method of claim 4 wherein the one or more protocol markers associated with 
the first content comprises motion vectors. 

8. The method of claim 4 wherein the one or more protocol markers associated with 
the second content comprises motion vectors. 

9. The method of claim 4 further comprising the steps of: 
identifying a length of the first content; and 
identifying a length of the second content. 

10. A content comparator adapted to compare first content with a second content, the 
comparator comprising: 

a protocol identification module configured to identify a first protocol associated 
with the first content and a second protocol associated with the second content; 

a plurality of data segmentation modules configured to select a set of data seg- 
ments from each of the first content and the second content; 

a plurality of signature computation modules configured to generate a first signa- 
ture of the first content and a second signature of the second content; and 

a signature comparison module configured to compare the first signature with the 
second signature. 

11. An apparatus for comparing a first content with a second content, the apparatus 
comprising: 

means for identifying a protocol encoding the first content and the second content; 
means for selecting a set of data segments from the first content and the second 
content; 

means for computing a signature of the first content and a signature of the second 
content; and 

means for comparing the computed signature of the first content with the com- 
puted signature of the second content. 
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12. The apparatus of claim 1 1 wherein the selected data segments comprises locations 
associated with one or more protocol markers. 

13. The apparatus of claim 1 1 wherein the means for computing the signature of the 
first content and the signature of the second content further comprises: 

means for identifying one or more protocol markers associated with the first con- 
tent; and 

means for identifying one or more protocol markers associated with the second 
content. 

14. The apparatus of claim 13 wherein the one or more protocol markers associated 
with the first content comprises discrete cosine coefficients. 

15. The apparatus of claim 1 3 wherein the one or more protocol markers associated 
with the second content comprises discrete cosine coefficients. 

16. The apparatus of claim 13 wherein the one or more protocol markers associated 
with the first content comprises motion vectors. 

1 7. The apparatus of claim 1 3 wherein the one or more protocol markers associated 
with the second content comprises motion vectors. 

18. The apparatus of claim 1 3 further comprises: 
means for identifying a length of the first content; and 
means for identifying a length of the second content. 

19. A method to compare a first content with a second content in a network storage 
environment, the method comprising the steps of: 

receiving the first content; 

computing a signature of the first content; 
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comparing the computed signature of the first content with a signature of the second 
content; and 

identifying, if the computed signature of the first content matches the signature of the 
second content, that the first content is identical to the second content. 

20. The method of claim 1 9 wherein the step of computing the signature of the first 
further comprises the steps of: 

identifying a set of protocol markers associated with the content; and 
generating the signature from the identified set of protocol markers. 

2 1 . The method of claim 1 9 wherein the set of protocol markers further comprise a set 
of discrete cosine coefficients. 

22. The method of claim 19 wherein the set of protocol markers further comprises 
one or more motion vectors. 

23. The method of claim 1 9 wherein a size of the received content is utilized in cre- 
ating the signature. 

24. A method for identifying content using a protocol associated with the content as a 
signature, the method comprising the steps of: 

determining the protocol associated with the content; 

identifying a set of markers associated with the protocol; 

obtaining a set of markers from the content using the set of marker associated 

with the protocol; and 

generating a signature of the content using the identified markers. 

25. The method of claim 24 wherein the identified markers are within a subset of the 
entire content. 
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26. The method of claim 24 wherein a size associated with the content is utilized to 
uniquely identify the content. 

27. The method of claim 24 wherein the signature is utilized in a network caching de- 
vice to determine whether data should be forwarded from the network caching device. 

28. The method of claim 24 wherein the signature is utilized to determine if a local 
copy of the content should be accessed. 

29. The method of claim 24 wherein the signature is utilized to determine if a remote 
copy of the content should be accessed. 

30. A protocol maker identifier for generating a signature of a content comprising: 
a data segmentation module configured to select a set of data segments from the 

content; and 

a signature computation module configured to generate the signature from the set 
of data segments. 

31. The protocol maker identifier of claim 30 further comprising a protocol identifi- 
cation module configured to identify a protocol associated with the content. 

32. The protocol maker identifier of claim 30 wherein the signature comprises a set of 
protocol markers. 

33. The protocol marker identifier of claim 32 wherein the set of protocol markers 
comprises a set of discrete cosine transform coefficients. 

34. A network caching device adapted to utilize a signature associated with a protocol 
for caching decisions, the network caching device comprising: 

means for determining a protocol of new contents; 
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means for computing a signature of the content; and 

means for comparing the computed signature of the new content with a signature 
of other content. 

35. The network caching device of claim 34 wherein the means for computing a sig- 
nature further comprises: 

means for identifying a set of makers associated with the protocol associated with 
the content; and 

means for obtaining appropriate markers associated with the content. 
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